Extraction and Visualization of Trend Information from Newspaper Articles and Blogs

نویسندگان

  • Hidetsugu Nanba
  • Nao Okuda
  • Manabu Okumura
چکیده

Trend information is a summarization of temporal statistical data, such as changes in product prices and sales. We propose a method for extracting trend information from multiple newspaper articles and blogs, and visualizing the information as graphs. As target texts for extraction of trend information, the MuST (Multimodal Summarization for Trend Information) workshop focuses on newspaper articles. In addition to newspapers, we focus on blogs, because useful information for analysing trend information is often written in blogs, such as the reasons for increases/decreases of statistics and the impact of increases/decreases of statistics on society. To extract trend information, we extract temporal expressions and statistical values, and we devised methods for both operations. To investigate the effectiveness of our methods, we conducted some experiments. We obtained a recall of 6.3% and precision of 31.3% for newspaper articles, and a recall of 44.8% and precision of 60.3% for blogs. From the error analysis, we found that most errors in newspaper articles were caused by misconversion of temporal expressions such as “同年” (the same year) or “前月” (the previous month), into “YYYY-MM-DD” form, although temporal expressions were detected correctly. In contrast to newspaper articles, there are few temporal expressions in blogs for which resolution is required, such as “同日” (the same day) or “前月” (the previous month). As a result, recall and precision for blogs are higher than those for newspaper articles.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Extraction of Trend Information from Newspaper Articles: Hiroshima City University at NTCIR-7 MuST

Trend information is a summarization of temporal statistical data, such as changes in product prices and sales. We propose a method for extracting trend information from multiple newspaper articles. Our group participated in the T2 Subtask at TCIR-7 MuST (Multimodal Summarization for Trend Information). Our goal was to evaluate the effectiveness of our rule-based system using the data provided ...

متن کامل

Public anxiety and information seeking following the H1N1 outbreak: blogs, newspaper articles, and Wikipedia visits.

Web-based methodologies may provide a new and unique insight into public response to an infectious disease outbreak. This naturalistic study investigates the effectiveness of new web-based methodologies in assessing anxiety and information seeking in response to the 2009 H1N1 outbreak by examining language use in weblogs ("blogs"), newspaper articles, and web-based information seeking. Language...

متن کامل

Discoursal Analysis of Rhetorical Structure of an Online Iraqi English Newspaper

Abstract Rhetorical structure is helpful in improving how the writers maintain cohesion in their writings. This study examines how the Iraqi writers maintain cohesion in the text by analyzing the various rhetorical moves in Azzaman, an online Iraqi newspaper. To this purpose, twelve opinion articles from Azzaman Iraqi newspaper, published from January 2013 to June 2013 were analyzed. The findin...

متن کامل

Discoursal Analysis of Rhetorical Structure of an Online Iraqi English Newspaper

Abstract Rhetorical structure is helpful in improving how the writers maintain cohesion in their writings. This study examines how the Iraqi writers maintain cohesion in the text by analyzing the various rhetorical moves in Azzaman, an online Iraqi newspaper. To this purpose, twelve opinion articles from Azzaman Iraqi newspaper, published from January 2013 to June 2013 were analyzed. The findin...

متن کامل

Exploring History Through Newspaper Archives

This demo presents a web application which implements a pipeline for searching and browsing through newspaper archives. It uses a combination of information extraction, enrichment and visualization algorithms to help users to grasp a large amount of articles normally collected in archives. Illustrative results show appropriateness of the proposed pipeline for searching and brows-

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007